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• • . 1 INTRODUCTION 



ABSTRACT 

Near future cosmology will see the advent of wide area photometric galaxy surveys, 
like the Dark Energy Survey (DES), that extent to high redshifts {z ^ 1 — 2) but with 
poor radial distance resolution. In such cases splitting the data into redshift bins and 
using the angular correlation function w{9), or the d power spectrum, will become 
the standard approach to extract cosmological information or to study the nature 
of dark energy through the Baryon Acoustic Oscillations (BAO) probe. In this work 
we present a detailed model for ■w{9) at large scales as a function of redshift and 
bin width, including all relevant effects, namely nonlinear gravitational clustering, 
bias, redshift space distortions and photo-z uncertainties. We also present a model for 
the full covariance matrix characterizing the angular correlation measurements, that 
takes into account the same effects as for w{0) and also the possibility of a shot- noise 
component and partial sky coverage. Provided with a large volume N-body simulation 
from the MICE collaboration we built several ensembles of mock redshift bins with a 
sky coverage and depth typical of forthcoming photometric surveys. The model for the 
angular correlation and the one for the covariance matrix agree remarkably well with 
the mock measurements in all configurations. The prospects for a full shape analysis 
of 'w{9) at BAO scales in forthcoming photometric surveys such as DES arc thus very 
encouraging. 



The statistical analysis of the distribution of structure at 
large astronomical scales has played a key role in advancing 
the field of Cosmology over the last 20 years. From shap- 
ing our understanding of complex processes driving galaxy 
formation and evolution to constraining the energy density 
content of the Universe. 

The completion of large extra-galactic surveys such at 
the Sloan Digital Sky Survey (SDSS, York et al. 2000) 
and the 2dF Galaxy Redshift Survey (2dFGRS, CoUess 
et al. 2003) have bolstered our general knowledge in the 
field. Particularly more so when combined with the precise 
measurements of the Cosmic Microwave Background or the 
increasingly reach data from Supernova data (Sanchez et 
al. 2009; Percival et al. 2010; Reid et al. 2010; Komatsu 
et al. 2010) . One of the most promising, and eventually 
rewarding, challenges for the field of large scale structure to- 
day is the prospect for determining what drives the late time 
acceleration of the Universe (Riess et al. 1998; Perlmutter 
et al. 1999). This is probed by the presence, in the cluster- 
ing pattern of galaxies, of remanent features from the cou- 



pling of baryon and photons prior to recombination known 
as the Baryon Acoustic Oscillations (BAO). The BAO have 
already been detected in the spectroscopic samples of Lumi- 
nous Red Galaxies (LRGs) in both SDSS and 2dFGRS (Cole 
et al. 2005; Eisenstein et al. 2005), and studied in the early 
imagining data of SDSS (Padmanabhan et al. 2007). 

But the observational quest has only started. Several of 
the next-generation surveys will gain in area and depth, in 
exchange for a poorer determination of radial positions. In 
turn this imposes the need for angular clustering analysis in 
redshift bins of width few times larger that of the photomet- 
ric error uncertainty at the given redshifts. The difficulty lies 
in that the projection in redshift bins lowers the clustering 
amplitude, erasing any particular feature and increasing the 
noise-to-signal ratio. The achievable precision of our pho- 
tometrically estimated redshift will play a crucial role. We 
thus need to understand what affects the angular clustering 
pattern more severely. 

The aim of our work is to tackle this problem, pro- 
viding a well calibrated model for the clustering signal at 
large-scales as a function of angle, radial distance and bin 
width, deepening the available literature in the subject (e.g. 
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Padmanabhan et al. 2007, Blake et al. 2007 and references 
therein). We put particular effort in stressing the most rel- 
evant effects, redshift distortions and photo-z uncertainties, 
and how they interplay. 

An equally important problem is to have the capability 
of estimating the full errors of the measurements. We thus 
provide a well tested description of the complex error matrix 
characterizing the measurements of the correlation function 
in real situations, i.e. including effects of partial sky cover- 
age, photo-z, redshift distortions, bias and shot-noise. 

Both, the model for the correlation and the one for the 
error matrix, will be extensively tested against a very rich 
set of mocks redshift bins. This work should therefore be 
relevant for ongoing projects that use photometric redshift 
estimates like the Dark Energy SurvejQ (DES), the Physics 
of the Accelerating Universe collaboratior(3 (PAU) and the 
the Panoramic Survey Telescope and Rapid Response Sys- 
tenlfl (PanStarrs). But also for upcoming imaging proposals 
such as the Large Synoptic Survey Telescopqj (LSST) and 
the ESA/Eucliclj survey. 

This paper is organized as follows. In Sec. [5] and Sec. [3] 
we discuss the models proposed in this work for the angular 
correlation function and its full error matrix respectively. 
In Sec. 0] we describe the set of ensembles of mock redshift 
bins implemented using a large volume N-body simulation. 
In Sec. [5] and [6] we test the models against the mocks, un- 
der different regimes and assumptions. Section [7] contains 
our conclusions and future lines of research. We also include 
several appendixes. In Appendix |X] we give a description of 
our model for the 3-d nonlinear matter correlation function. 
In Appendix [Bl we study the limitations of the widely used 
Limber approximation. Finally, Appendix [C] gives a brief 
note on the covariance of the angular power spectrum in- 
duced by partial sky coverage. 



where 6 is the angle between directions n and h + 9, related 
to the pair-separation through 

ri2(e) = {r{z^f + r{z2f - 2r{z^)r{z2) cos(6l)}''" (3) 
and r{z) is the co-moving distance to redshift z given by 



r{z) 



H{u) 



(4) 



where H{z)/Ho = + z)^ + QoEil + is the 

Hubble parameter, Q,m and $1de = 1 — are the matter 
and dark energy densities respectively, and w is the dark 
energy equation of state 0. Since we are interested in red- 
shift bins and not in extended selections we can neglect the 
growth evolution within the bin and simply evaluate the 3-d 
correlation in some fiducial redshift z (e.g. the mean red- 
shift of the bin, weighted by </>). In addition we will assume 
a local and linear bias relation between fluctuations in the 
tracer (e.g. galaxies) and matter density field, Sg = b{z)5 
(see Sec. 15.21 for a justification). Under these assumptions 
Eq. ^ is converted to 



dzi /(^i) / dz2 f{z2) ar{zi),r{z2), 6, z) (5) 



w{e) 



where f{z) = b[z) <j){z) and ^ is the matter 3-d correlation 
function. 

Hence, in order to predict ?ii(S) we need a model for the 
spatial clustering accurate in a sufficiently large range of 
scales to allow the projection in Eq. ((5]), in particular when 
photo-z errors broadens the extent of the radial distribution 
(see (Sanchez et al. 2010) for a empirical parametric fit 
suited for BAO scales). 

In what follows we discuss how to include photo-z effects 
and the model for spatial clustering in real and redshift space 
that we will use throughout this paper. 



2 AN ANALYTIC MODEL FOR THE 

ANGULAR CORRELATION FUNCTION 

Let us start by considering the projection of the spatial 
galaxy fluctuations 5g (x, z) along a given direction in the 
sky n 



dz(j>{z)Sg{n,z), 



(1) 



where (j> is the radial selection function. The angular corre- 
lation function is then obtained as a simple projection of the 
3-d correlation function ^ (Peebles 1973), 

w{e) = (5,(n)5,(n + e)) = 

= J dzi<l>{zi) J dz2(l}{z2)^gg{r{zi),r{z2),e) (2) 



^ www . darkenergysurvey . org 
^ www . pausurvey . org 

pan-stars.ifa.hawaii.edu 

www.lsst.org 
^ www.euclid-imaging.net 



2.1 Photo-z 

We incorporate the way uncertainties in the true redshift po- 
sitions obtained from photometric estimates affect angular 
clustering by means of the radial selection function, follow- 
ing Budavari et al. 2003 (see also Ma et al. 2006). 

The radial selection is the probability to include a 
galaxy in our redshift bin. If the selection of objects is done 
according to their true redshifts, then (p is equal to the true 
number of galaxies per unit redshift times a window function 
W encoding selection characteristics (e.g. redshift cuts). 



{z) = ^W{z). 



(6) 



Instead, if the selection is done according to photomet- 
ric redshift estimates, one must incorporate the probability 
P{z\zp) for the true redshift to be z when the photomet- 
ric one is Zp. The ending result is the product (Budavari et 
al. 2003), 



dNg 

dz 



dzpP{z\Zp)W{Zp), 



(7) 



^ These expressions explicitly assume a fiat cosmology and con- 
stant ui, for more general cases see (Matsubara 2004) and refer- 
ences therein 
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where W{zp) is the photometric redshift window function. 
Throughout this paper we wiU only consider top-hat win- 
dow functions both in true and photometric redshifts (i.e. 
W = 1 within a given redshift range, and otherwise). In 
addition we will only consider the idealized CEise where the 
photometric estimate is Gaussianly distributed around the 
true redshift (e.g. Ma et al. 2006). Although this might 
be far from reality, it serves as an interesting starting point 
for more realistic scenarios (Hearin et al. 2010; Bernstein & 
Huterer 2010). Lastly, we recall that (f> should be normalized 
to unity within the redshift range of interest. 

2.2 Spatial clustering and redshift evolution 

We now turn into the discussion of the 3-d matter corre- 
lation model accounting for nonlinear gravitational effects, 
redshift space distortions, and the way we evolve it with red- 
shift. We postpone to Appendix |A] the testing of this model 
against measurements of 3-d matter clustering in N-body 
simulations. 

The linear evolution of the clustering pattern preserves 
its shape but increases the overall amplitude. The main ef- 
fects due to nonlinear gravitational clustering at large scales 
are a smoothing of the BAO wiggles and a rise in clustering 
amplitude above linear values towards smaller scales due to 
mode-coupling effects (Seo & Eisenstein 2005; Eisenstein et 
al. 2007; Crocce and Scoccimarro 2008). Although these 
processes can be modeled from first principles (Crocce and 
Scoccimarro 2006a; Matarrese & Pietroni 2008; Matsub- 
ara 2008; Taruya et al. 2009), it is also possible and de- 
sirable to find simpler parametric approximations. In the 
correlation function these two effects can be parameterized 
as (Crocce and Scoccimarro 2008), 

i{r) = KLin(r) ® e-e-Z^-^'^Kr) + ^^,CS.(OCLin(r) (8) 

where S(,ao and Amc are fitting parameters, ^Lin is the linear 
correlation function at the given redshift, ^l;,-; its derivative 
and 

e[ii = 47r j Pi^i4k,z)ji{kr)kdk. (9) 

In Eq. (|8} , the symbol (8 denotes a convolution. This model 
have been already used in the analysis of matter, halo and 
galaxy clustering (Sanchez et al. 2008; Sanchez et al. 2009). 

However the standard approach for analyzing cluster- 
ing data in a photometric redshift survey covering from low 
{z ~ 0.2) to high redshift (z ~ 1.4) is to divide the data into 
several redshift bins (whose minimum width are ultimately 
determined by the photo-z accuracy, e.g. Padmanabhan et 
al. 2007). If one then performs a joint analysis of all these 
bins it is desirable to have the least number of nuisance pa- 
rameters possible in order to optimize constraints on derived 
cosmological parameters. From this point of view it is inter- 
esting to investigate to what extent a single set of best-fit 
parameters can be used to describe the 3-d clustering from 
low to high redshift, and hence the angular clustering after 
the projection in Eq. ([5]). 

We implement this as follows. The first term in Eq. (|8]) is 
proportional to the linear correlation, therefore it scales with 



the growth factor squared ~ D^{z). The second term arises 
from leading order mode-mode coupling and thus scales as 
~ D'^{z). In turn the damping of BAO is proportional to 
the amplitude of large-scale velocity flows, Eq. (|A1|) . and so 
Sbao ~ D{z). Putting these considerations together we scale 
our parametric model with redshift as, 

ar,z) = i3'(z)[SLin,o(0®e~*''^'"'^'''°°''] W 

+ Ar.,D^{z)^iljr)^i,^^o{r) (10) 

where sub-script means (linear) quantities evaluated at 
z — 0. The values for Sbao and Amc can be taken from a 
best-fit analysis to ^(r) at any given redshift (or to ■w{0) 
in any given redshift bin, after the projection in Eq. ©). 
In our case will be those from the best-fit at z = 0.3, 
Sbao ~ 6.37/i~'^Mpc and Amc = 1.55. This is detailed in 
Appendix [X] where we present a detailed comparison of our 
model against numerical simulations, with particular em- 
phasis on the scaling introduced in Eq. (|10|l . 

Lastly, we move to the inclusion of redshift space dis- 
tortions. The true distance to a galaxy differs from the one 
derived from its redshift through the Hubble law because of 
the radial peculiar velocity of the galaxy on top of the Hub- 
ble flow (Kaiser 1987). At large scales, the coherent infall of 
galaxies into large overdensities, such as clusters, make their 
observed radial separation smaller, squashing the structure 
along the line-of-sight and boosting the amplitude of the 3- 
d two point correlation. In this way for separations along 
the line of sight tt ^ 40 h^^ Mpc the correlation (or number 
of pairs) increases dramatically, while for larger separations 
the correlation becomes negative in such a way that the to- 
tal number of pairs along the l.o.s is preserved (e.g. Fig. Al 
in Gaztanaga et al. 2009). This implies that, by dividing 
the data in redshift bins, one is discarding the leverage of 
large radial separations effectively increasing the (angular) 
correlation within the bin (see Nock et al. 2010 for a recent 
and detailed discussion of this effect). 

The linear redshift distortions discussed above, namely 
the Kaiser effect, can be easily described assuming the 
plane-parallel approximation. We will incorporate it into 
our modeling of the angular correlation function by writ- 
ing ^(n, r2) = 5(cr, tt) in Eq. (gj, with (Hamilton 1992), 

^(a,^) =Co(s)Po(M)+e2(s)P2(M) + e4(s)P4(M), (11) 

where tt — r2 — ri and — 2rir2(l — cos ^) (to yield s = ri2) 
are the pair-separation along and transverse to the line-of- 
sight, /X = tt/s and Pi are the Legendre polynomials. The 
double integrals in Eq. ([2| are still performed in the ri, r2, 
variables leaving the evaluation of the radial selection func- 
tions unchanged. The monopoles of the anisotropic correla- 
tion ^i(s) are given in Appendix |B2] fEgs. [B"10IB11IB12'|| in 
terms of the 3-d monopole correlation ^(s), that we will take 
as the one including nonlinear gravitational effects given by 
Eq. (|10|l . In Appendix IB2I we discuss this effect in more de- 
tail, also in the context of the Limber approximation. 
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3 ANALYTICAL MODELING OF THE 
COVARIANCE MATRIX OF W{e) 

An equally important aspect to the understanding of the sig- 
nal in clustering analysis of galaxy surveys is the capability 
to estimate the corresponding errors in the measurements. 
This is of particular importance for analysis that use cor- 
relation functions in Configuration space because the mea- 
surements are highly correlated. 

Notably there is scarce work in the literature aiming at 
developing analytical estimates of the covariance matrix of 
angular correlation functions besides the early work of Bern- 
stein 1994, who developed an error estimate for the Landy 
& Szalay estimator in terms of higher order correlations. 

Most "data analysis" papers have relied on sub- 
sampling techniques of the data itself , such as jack-knife, 
bootstrap and field-to-field variations (e.g. Ross et al. 2007; 
Meneux et al. 2009; Sawangwit et at. 2009). However as 
noted in the comprehensive work of Norberg et al. 2009 
all these approaches have failings, at least in 3-d clustering, 
depending on the way they are implemented and the regime 
of scales of interest. On the other hand, projection along 
the line-of-sight alleviates this tension leading to a good 
agreement with theoretical estimates, as shown by Cabre et 
al. 2007 in the context of cross-correlations between galaxy 
and CMB maps. 

In what follows we try to revert the lack of analytical 
work provided that we are interested on large angular scales, 
where nonlinear (i.e. non-Gaussian) effects are weaker and 
that we know how to model the signal itself. We thus concen- 
trate in discussing how to model expected errors in angular 
clustering, including the effects of sampling variance, shot- 
noise, partial sky coverage, photo-z and redshift distortions. 
We put particular emphasis on the description of the full 
error matrix, and not only the diagonal components, and 
leave for further work the assessment of possible systematic 
effects. 

Let us start by decomposing the fluctuations in the 
number of objects "per pixel" in the sky into spherical co- 
ordinates (Peebles 1973), 

t 

^i^) = Y. Y. a-tmYi^{ii), (12) 

where n is the line-of-sight direction and the spheri- 
cal harmonics. The coefficients in this expansion form the 
angular power spectrum, 

{aemai'm') = ^U'Smm'Ct (13) 

that can be related to the angular correlation function using 
the Addition theorem [3| yielding, 

H0) = J2(^^)P^(^°'(^)C^ (14) 

where Pi are the Legendre polynomials of degree £. The 
covariance in the measurements of w{6) can then be related 



to those in Ci as, 

/2/ + 1 \ ^ 

Coves' = (^^) Pi{cos 6) Pe, {cose') Coy It' (15) 

where 

CoYeo' = {w{e)a{e')), Cov«, = {CiCi,), (i6) 

and w{6) and Ci denote the estimators used for w{6) and Ci 
respectively. In a full sky situation, and assuming the 
spectra are Gaussianly distributed, the Ce measurements 
are uncorrelated, Cov«/ = Vax{Ci)5u' ■ In addition, one can 
estimate each I power using the 21+1 available modes, 

= ^ E (17) 

thus, Var(C£) — 2Cf/{2l + 1). Replacing these relations 
back into Eq. ((T3J leads to the final expression for Covggi in 
a full sky survey. 

However a more realistic and interesting scenario is one 
in which the sky coverage is partial. In Cabre et al. 2007 it 
was shown, using Gaussian realizations of the airn spectra, 
that errors in configurations space scale as 1/ \J ]sky (which, 
in turn, is the scaling of the available number of harmonic 
modes). In what follows we will assume this scaling, and 
compute the covariance matrix as (Dodelson 2003; Cabre 
et al. 2007) 

2 2/ -I- 1 

Covee- = ^Y. -7-r^Peicose)Pi{cose') {Ci + 1/nf 
Jsky i47rj 

(18) 

where we have also included the standard shot-noise con- 
tribution arising in the variance of the d estimates (Pee- 
bles 1973) (n is the number of objects per steradian). We 
remark that the assumption leading to Eq. psp is not that 
the Ce covariance remains diagonal in a partial sky survey 
but instead that Cov(6', 8') can be obtained from its full sky 
expression by the scaling 1/ fsky We discuss this further in 
Appendix [Cl 

To proceed further we thus need a model for the angu- 
lar spectra. In real space the Ce spectra are given by (see 
Appendix IX)) 

Cf.Exact = ^ / 47rA:'dfcP(fc)*|(fc) (19) 

where P{k) is the real space matter power spectrum and, 

*^(fc) = J dz(t>{z)D{z)je{kr{z)). (20) 

Throughout this paper we will use the linear theory power 
spectrum in Eq. (|19|) . We have tested that the inclusion of 
small scale nonlinear effects (or the damping of the baryon 
acoustic features) have no impact in our predictions for the 
errors at the (large) angular scales we are interested in. 

Redshift space distortions are accounted for by follow- 
ing the same procedure that leads to the Ce expression in 
Eq. (|19|) but starting from a power spectrum that includes 
the Kaiser effect discussed in Sec. I2.2l and Eg. IB8I The final 
result is simply the following additive contribution to the 
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kernel in Eq. (Padmanabhan et al. 2007) , 



*f (fc) = P j dz(j){z)D[z) 



{2f 



21-1) 



(2^- 1)(2^ + 1) 



jt-2{kr 



(2^ + 3)(2^- 
(^+ l)(^ + 2) 



-je(kr) 



(21) 



where r = r[z). In turn, photo-z effects are incfuded tlirougli 
the radial selection function (l>{z), as discussed in Sec. 12.11 

Notice however that the expressions in Eqs. (|19l20l2ip 
are numerically expensive to evaluate due to the oscillatory 
behavior of jt{x) for a: 1. In Appendix IB II we discuss 
our own approach to perform these integrals, valid at large 
scales and involving the natural cut-off stao in Eq. ([8]). 



4 N-BODY SIMULATION AND MOCK 
SURVEY CATALOGUES 

As discussed previously, we aim at developing and test- 
ing analytical expressions for the signal, variance and co- 
variance of the angular correlation function w(S) against 
measurements in simulated upcoming photometric surveys. 

Hence, in order to have a robust statistics but at the 
same time be representative of such future surveys we used 
partitions of a very large N-body simulation, provided by 
the MICE coUaboratiorjj, to build survey mocks. The sim- 
ulation, named MICE7680, tracked the gravitational evo- 
lution of 2048'^ dark-matter particles within an unprece- 
dented comoving volume of Lbox = 7680 /i^^ Mpc using 
the Gadget-2 code (Springel 2005). Initial conditions were 
set at Zi — 150 using the Zeldovich dynamics and assum- 
ing a flat LCDM cosmology with parameters Qrn = 0.25, 

= 0.75, fib = 0.044 and h = 0.7. The spectral tilt was 
set to Us — 0.95 and the initial amplitude of fluctuations set 
to yield as = 0.8 a,t z = 0. The resulting particle mass was 
3.65 X 10^^ Mq (see Fosalba et al. 2008 and Crocce et 
al. 2010 for further details). 

Without loss of generality we next assumed a survey 
covering a continuous 5000 deg^ of sky (i.e. a sky fraction 
/sky = 1/8), and redshift coverage in the range 0.2 < z < 
1.4. In broad terms, this matches the specifications for DES. 

In turn, the fact that redshifts are estimated photo- 
metrically implies that much of the "small-scales" radial 
information is lost. In such scenario the best approach is 
to study angular clustering in redshift bins of width larger 
or comparable to the mean photo-z error (e.g. Simpson et 
al. 2009). We thus built photo-z survey mocks by extracting 
spherical shells of varying width from "comoving" outputs 
of MICE7680. Each shell is restricted to span only one oc- 
tant in angular size and its radius f{z) is set to match the 
comoving distance to the redshift of the output. 

To place the spherical shells within the simulation box- 
size we defined observers in a regular grid and set the num- 
ber of grid-points in each direction A''^ in order to have none 
or minimal volume overlap between different mocks (while 
placing the shells in the positive octant of the observer). 
Strictly non-overlaping mocks can be achieved by setting the 



spacing along two cartesian axis equal to f+Ar/2, while the 
third to V2fAr. Thus we defined as the integer part of 
Lbox/y/2fA, while Nx and Ny as the round-off of f -I- Ar/2 
to its nearest integer 0. Notice that rounding-off instead of 
taking the integer part could lead to a sligth volume overlap 
in those directions. However we have explicitly checked that 
volume overlap never exceeds 1% in any of the mocks. 

Note that the very large size of MICE7680 is a critical 
point in order to have a robust statistics in the whole redshift 
range of the assumed survey. 

In order to have a clearer understanding of the different 
components of the model we built ensembles of mocks in 
increasing "layers of reality" . We first selected dark-matter 
particles directly from real space assuming a radial distri- 
bution as expected in DES. We also tested, and dismissed, 
effects due to biased tracers by repeating this exercise start- 
ing from halo catalogues. Next we moved particles to redshift 
space before doing the selection. Alternatively, we imposed 
a random radial uncertainty in the position of each parti- 
cle before selection to mimic photometric error. Finally we 
imposed the radial distribution in addition to redshift dis- 
tortions and photometric error to build mocks which are 
closest to a real survey. 

Once the particles were selected we built angular num- 
ber density maps in the Healpix format with Ngide = 256 
(Gorski et al. 19990 This Ns^de corresponds to 98304 
pixels in 1/8 of sky with an angular resolution of 13.75 
arc-min. In this way, and given our mass resolution rUp — 
3.65 X 10^^/i"^Mq, we obtained mocks with ~ 0.1 — 0.5 
galaxies per squared arc-min (depending on the bin-width) 
in the low redshift bins {z — 0.3) and ~ 1 — 4 galaxies per 
squared arc-min at high redshift {z = 1.1). Once the den- 
sity field was pixelized we measured the correlation function 
using the standard estimator (Barriga & Gaztafiaga 2002; 
Eriksen et al. 2002) 



(22) 



jhttp: //www. ice . cat/mic e j 



where Si = (n^ — {n))/{n) is the density fluctuations in the 
i-th pixel and Npair is the number of pixel pairs. 

In what follows we give a more detailed discussion of 
the different cases considered, while Table [T] contains a de- 
scriptive summary of the flnal suite of mock ensembles used 
throughout this paper. 



4.1 Real Space Mocks 

Let us start by describing our mocks in real (or conflgu- 
ration) space. To fairly sample the range 0.2 < z < 1.4 
we select comoving outputs of MICE7680 at the redshifts 
z = 0.3, 0.5, 0.73 and 1.1. In each output we extract all 
the particles within spherical shells of radius equal to the 



^ Except when we include redshift distortions and/or photomet- 
ric error where take Nx = Ny = and round off ii,oa;/''(2*) 
instead, with 2+ the redshift at which the true redshift distribu- 
tion in Eq. Q is negligible (see Fig. [2]| 
http://healpix.jpl.nasa.gov 
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comoving distance to the given redshift. In turn, to be rep- 
resentative of typical photo-z errors we set 4 different bin 
width for each z, namely Az/{l + z) = 0.03, 0.05, 0.1, 0.15. 
The comparison of these 16 mock configurations with the 
approximate (Jz{z) expected for DES (Banerji et al. 2008) 
is shown in Fig. [T] 

Translated to comoving distance these bins range from 
100 to 500 Mpc in width. These spherical shells are re- 
stricted to have right ascension and declination in 0° — 90° , 
therefore covering 1/8 of sky. Lastly, by randomly select- 
ing particles within each bin we impose the following radial 
distribution 

dN/dz oc (2/0.5)^ exp [-(2/0.5)'''] , (23) 

which is also what is expected in DES (we thank DES LSS 
working group for providing this). 

The resulting number of mocks depends on z and 
A2;/(l -I- z), but it ranges from tens to thousands, making 
these set of ensembles very suitable for error studies as well 
as for testing models and systematics. The top panel of Ta- 
ble [T] includes the main characteristics for 11 of these bins, 
which are the ones that for concreteness we focus on in this 
paper, although our conclusions extend to the full set. 



4.2 Real Space Mocks for biased tracers 

In order to study differential features in the angular cluster- 
ing of biased tracers as compared to that of dark-matter we 
also built mocks starting from halo catalogues. 

We concentrated in two characteristic redshifts and 
built mocks in exactly the same manner as described in 
Sec. 14.11 We note however that from the numerical point 
of view it is very hard to resolve halos of lO^^"'^^ in 
a volume as large as the one we are considering here, ~ 450 
cubic Gpc/h. Thus, to be able to reproduce the mass-scale 
of LRG halos we choose poorly resolved halos (or groups) as 
tracers. Note that, although poorly resolved, these groups 
have the same clustering amplitude and abundance of real 
LRG galaxies (Cabre and Gaztanaga 2009). 

At z = 0.3 we consider groups of M > 10^^ h-^ M© 
(i.e. 5 or more particles) and bin width A2/(l -f 2) = 
0.15. The spatial abundance of these tracers is fi = 1.7 x 

At 2 = 0.5 we consider halos with M > 2 x 10^^ M© 
(8 or more particles) with n = 0.49 x 10"* /i^ Mpc~^. 
With these selections we try to mimic LRG halos of M > 
10" /i-^Mq. At 2 = 0.5 we also consider cluster mass-scale 
halos of M > 10^* Mq (35 particles or more), their 
abundance given by n = 0.64 x 10"^ Mpc~=^. 

These mocks are summarized in the second panel of 
Table [1] We do not consider bins at higher redshifts, as the 
galaxy bias is expected to be closer to linear and local. In 
addition, since the linear local bias model holds valid to the 
extent we are able to investigate, see Sec. 15.21 we will next 
concentrate on the effects of photo-z and redshift distortions 
on the matter field itself. 



Real Space Mocks 
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0.3 
0.3 
0.3 


0.03 
0.10 
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0.11 
0.33 
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0.5 
0.5 
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0.05 
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0.15 


1345.8 
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Real Space Halo Mocks 
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Redshift Space Mocks 
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0.06 
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0.5 


0.05 
0.15 
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0.06 
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125 
125 





Table 1. Suite of Mock catalogues. Each mock consist of a red- 
shift shell subtending 1/8 of sky at a radial comoving distance 
and width as listed in the top panel. All mocks corresponds to 
dark-matter particles (except real space halo mocks) with a radial 
distribution given in Eq. l|23|l . The mean distance f and width Ar 
arc in Mpc. The surface density n in particles per square arc- 
min (and for a given bin is similar for real, redshift and photo-z 
space). Halo- masses are in Mq and D is the "linear" growth 
factor. The photo-z error cr^ equals 140 Mpc for our cosmol- 
ogy. The bias reported was obtained from the ratio of angular 
correlation functions (see Sec. 15.21 also for associated error bars). 
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4.3 Redshift Space Mocks 

To understand the importance of redshift distortions, and 
the accuracy of the modeling, we built mocks where we im- 
pose the radial distribution in Eq. (|23p but displace the par- 
ticles to redshift space prior to the top-hat selection. 

We concentrated in the comoving output of MICE7680 
at z = 0.5 since this redshift is the typical mean z for up- 
coming photometric surveys such as DBS or PanStarrs. We 
then identified redshift bins of width A/{l + z) — 0.05 (thin) 
and A/{l + z) = 0.15 (thick). In each redshift shell the map- 
ping from real r to Redshift Space positions s is given by the 
transformation 



' + vr{l + z)/H(z)r 



(24) 



where H is the Hubble parameter and Vr the peculiar veloc- 
ity of the object along the line of sight from the observer. 
Therefore given the observer at position ro (the center of the 
sphericall redshift shell) we first find the particle's projected 
velocity along the l.o.s. to the observer, 



-ro 



(25) 



then displace it by Sr = Vr{l + z)/H{z) along the l.o.s. 

, (r - ro) 



5v = 5r- 



ro 



(26) 



and finally do the top-hat selection. It total, we built 125 
mocks subtending one octant of angular size for each of these 
two bin widths (see Table [l]). 
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Figure 1. Mocks configurations We built several ensembles of 
mock redshift bins covering 1/8 of sky (5000 sq deg) and with 
mean redshifts z = 0.3, 0.5, 0.73, 1.1 (shown by the inset ver- 
tical arrows). For each z we set four different redshift width 
i^z/{l+z) = 0.03, 0.05, 0.1, 0.15 (dashed lines, bottom to top re- 
spectively). A more detailed description of the ensemble of mocks 
for each configuration is given in Table I. The aim is to resem- 
ble with high statistical accuracy the geometry of large area and 
deep photometry galaxy surveys, such as the Dark Energy Sur- 
vey (DES). The expected photo-z error in DES (using griz bands) 
is shown by the long dashed line, while the solid line shows the 
resulting photo-z after adding the JHKs filters from the Vista 
Hemisphere Survey (from Banerji et al. 2008). 



4.4 Photo-z Space Mocks 

Sets of mocks including (Gaussian) Photo-z errors were also 
produced in almost the same manner, except that the dis- 
placement along the l.o.s was random with a probability 

2a? 



f{5r) 



2'Ka, 



exp 



(27) 



where = azc/H{z), and is the survey photometric 
uncertainty at the given z. We only considered az = 0.06, 
which is the nominal value for DES a.t z — 0.5 using the 
griz photometric bands (Banerji et al. 2008) as reproduced 
our in Fig. [l] This is also the expected photo-z accuracy for 
the Stt all-sky survey of the Pan-Starrs collaboration (Cai et. 
al 2009) at this redshift using grizy bands alone. In addition, 
it is the approximately photo-z precision obtained for the 
optical sample of LRGs selected from the SDSS imaging data 
(Padmanabhan et al. 2005; Padmanabhan et al. 2007). This 
is thus a very representative value for a^. For our cosmology 
it translates to an uncertainty in the radial distance of ~ 
140 Mpc (see Table[T]for details). Once the photometric 
error was added we selected the particles using top-hat cuts 
in photometric redshift of width Az/{1 + z) = 0.05 and 0.15. 

The net effect of selecting a sample according to their 
photometric redshift, e.g. with a top-hat criteria, is to have a 
broader distribution of true redshifts of the selected galaxies. 
This is depicted in Fig. [2]where the solid line corresponds to 
the standard top-hat selection in true redshift as done in the 



previous mocks. Dashed line shows instead the true distribu- 
tion of objects (i.e. as a function of their true redshifts) that 
in our photo-z mocks entered the (top-hat) photo-z bins of 
Az/(l + 2) = 0.15 (left panel) and Azl{\ + z) = 0.05 (right 
panel). 

These radial selection functions were obtained from 
Eq. ([7]) using the underlying distribution from Eq. (|23p 
(shown by the solid line) and a Gaussian P(z\zp') of width 
Oz = 0.06, Eq. (|27p . and have been normalized to unity 
when integrated over redshift. Their importance lies in the 
fact that this is all one needs in order to compute the model 
correlation function, as discussed in Sec. (2] In addition, no- 
tice that we have chosen mock bin widths such that Az ~ 
(i.e. a "narrow bin" of width comparable to the photo-z) or 
Az ~ 4(T2 (a "broad bin"). 



4.5 "Survey" Mocks : dNdz, RSD and Photo-z 

Finally we built mocks that include all the aforementioned 
effects: a realistic radial distribution Eq. 1231 redshift distor- 
tions and photometric redshift with uncertainty a^, — 0.06. 
We again concentrated in the mean redshift z = 0.5 and 
two bins of width Az/{1 + z) = 0.05 ~ ("narrow") and 
Az/{1 + z) = 0.15 4 X az ("wide"). Therefore, these 
are the closest to an actual photo-z survey such as DES 
or PanStarrs. 
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Figure 2. Radial distribution of galaxies vs. true redshift in our 
redshift distortions and photo-z mocks. Solid lines are the radial 
distribution of objects for a top-hat selection in true redshift of 
width A2/(l + z) = 0.15 (left panel) and A2/(l + z) = 0.05 
(right panel), both centered at z = 0.5 and assuming a constant 
spatial density. Dashed lines shows instead the distribution of 
objects as a function of true redshift if the same top-hat selection 
is now done in photometric redshifts. In this later case a Gaussian 
photometric redshift error of Oz = 0.06 is assumed. 



Before moving on we want to stress that our mocks 
are obtained from comoving outputs of an N-body simula- 
tion and thus contain aU correlations induced by non-linear 
gravitational evolution as well as projection effects, partial 
sky coverage and realistic radial selection function. In addi- 
tion some account for photo-z and/or RSD effects. However, 
since we do not use light-cone outputs they do not include 
evolutionary effects of the sample within the redshift bin. We 
argue that, for the narrow bins under consideration, light- 
cone effects are negligible in front of photo-z and RSD (in 
the same way nonlinear gravitational effects are, see Sec [5]). 
To leading order, light-cone effects will introduce a linear 
evolution, which can be estimated from Eq. ((5]) by weighting 
the selection by the corresponding linear growth (i.e. defined 
with respect to the mean redshfit), / — D{z, z) b{z)(f){z). By 
doing this, we have found no difference with the same cal- 
culation that instead evaluates the growth a the mean red- 
shift of the bin. Nonetheless, this is certainly an interesting 
subjet that needs to be addressed more properly (i.e. with 
light-cone vs comoving outputs). We leave that for further 
work. 



tion function. The resulting correlations are shown in Fig. [3] 
compared with measurements in the mocks at the 4 differ- 
ent mean redshifts, z — 0.3, 0.5, 0.73, 1.1 (top to bottom). 
In each case for a bin width of the size of the typical photo- 
metric error achievable at the given redshift in a survey like 
DES (Banerji et al. 2008). 

The agreement between our theoretical model and the 
mean of the measurements is excellent for all configurations 
tested, see Table [T] in particular those shown in Fig. O And 
we recall that we are using only two parameters obtained 
from a best-fit to ^ at z = 0.3 o Hence the evolution with 
redshift is a component of the model. In each case displayed 
error bars correspond to the error on the mean of the en- 
semble (i.e. a / \^ Nmocks) , that given the large number of 
ensemble member we count on is remarkably small. 

The importance of nonlinear effects in front of projec- 
tion ones are minor if we consider the large error bars achiev- 
able in one single mock measurement. It is however encour- 
aging that one is able to model accurately a large range of 
angular scales, given the mixing of all the distance scales 
involved in the redshift bin projection. We note that the 
second term in Eq. ^ does not impact the shape and po- 
sition of the BAO, but it does bring theory and mocks in 
better agreement for smaller angular separations. 

One interesting result would be to have an estimate of 
the minimal angle the model is able to reproduce. A conser- 
vative approach to investigate this is to match the smaller 
comoving scale involved in the projection of a galaxy pair 
subtending an angle 8 within a redshift bin (where rmin{z) 
is the lower limit of the redshift range). 



rs = rmin (z) (2 — 2 cos 6^) 



1/2 



(28) 



with the minimum scale one is capable of modeling in the 
3-d clustering. 

Interestingly we have found that, if we define r„i as the 
scale where the model departs from the data by some fixed 
percentage, then our minimum scale satisfies. 



r„i{z) ~ r„i{z = 0) X D{z) 



(29) 



and rni{0) ~ 20/i"^Mpc for a 15% error in This en- 
sures the following minimal angles above which the model 
should perform well for the 4 cases shown in Fig. |3] 6m.in ~ 
1.3°, 0.7°, 0.5°, 0.4°, in agreement with Fig. O But again, 
this is a conservative limit because the relative contribution 
of scales ~ to the full redshift bin projection is minor. 



5 MODEL VS. MOCKS I : THE 
CORRELATION SIGNAL 

In Sec. [2] we discussed in detail our model for the angular 
correlation function. We now proceed to show how it per- 
forms against clustering measurements in the ensembles of 
mock redshift bins described in Sec. [l] 

5.1 Nonlinear Gravity and evolution 

Using the model for ^(r, z) in Eq. (|10|) we now project into 
redshift bins according to Eq. ((S]) to find the angular correla- 



5.2 Biased tracers 

We now revisit to what extent the possible presence of scale 
dependent bias in the spatial correlation function of tracers 
(Smith et al. 2008; Manera and Gazatanaga 2009; Des- 
jacques and Sheth 2010; Desjacques et al. 2010) translates 
into the angular correlation function, depending on the bin- 
width and mean redshift. Or equivalently, to what extent 
the assumption of linear bias holds in the angular clustering 

The level of matching does not change if we use instead the 
theoretical expectations for these parameter discussed in Sec. 12.21 



© 0000 RAS, MNRAS 000, 000-000 



Modeling the angular correlation function and its full covariance in Photometric Galaxy Surveys 9 



03 -3 



Real Space 




z = 0.3, Az/(l-z) = 0.1 



2 3 4 5 6 7 





= 0.73, A7/(l I z) = 0.1 



2 3 



-1 




z = 1.1, Az/(l + z) = 0.15 
0„.„ = 2.5-- Xi/l„ ~6 



of halos. For this exercise we thus use the suite of mocks 
described in Sec. 14.21 

The top panel of Fig. |4] shows the ratio of correlation 
functions (i.e. the bias) measured in the 392 mocks of z = 0.3 
and Az/{1 + z) ^ 0.15 for halos M > 10^^ Mq. Middle 
and bottom panels shows the same ratio but from the 175 
mocks bins a.t z = 0.5 and Az/{1 + z) = 0.1 for masses 
M > 2 X 10^^ h-^ Mq and M > lO" h'^ Mq respectively. 
For reference the vertical blue arrow shows the position of 
the BAO peak in each case. 

In all cases the bias is scale independent well within 
error bars (corresponding to the standard deviation of the 
mean of the ensemble) and at the 2 — 3% level for the cases 
mimicking galaxy clustering (M ~ 10^^ M©). For cluster 
mass-scale the shot-noise of the sample is much larger and 
consequently so the error bars. Nonetheless, there is not clear 
tendency with scale. We recall that our galaxy-type halos 
are poorly resolved. However we have found that increasing 
the resolution (i.e. the minimum number of particles per 
halo) and so the typical halo-mass, does not alter the scale 
independence of the bias shown in the top panels of Fig. [4] 

The error bars displayed in the top and middle panels 
of Fig. Q were obtained from the r.m.s of the mocks and 
correspond to the mean of the ensemble. 

For cluster mass-scale halos (bottom panel) it was not 
possible to obtain the bias ratio [whhiS) / w{9)Y^'^ for every 
single mock and angular separation due to the large shot- 
noise dominated errors that led to Whh < in some cases. 
We have then estimated the bias as {{whh} / (w))^^'^ and the 
errors propagating the r.m.s. ensemble errors in the halo 
angular auto-correlation Whh and the matter angular auto- 
correlation w as, 



Sb/b = (1/2) [{Swhh/whhf + (Sw/w 



,21 1/2 



(30) 



and converting to errors on the mean by Sb — >■ Sb/^/Nmocks- 
We have tested using the lower mass-scale halos that this 
approach of the purely ensemble average leads to the same 
bias and errors. 

In summary, we conclude that there is no evidence of 
scale dependent bias for these tracers within the error bars. 



5.3 Redshift Distortions and Photo-z 



e[degj 

Figure 3. Angular Correlation Function measured in 4 ensem- 
bles of mocks z-bins in Real Space compared with our nonlinear 
model (solid blue) and linear theory (red dashed line). Error bars 

correspond to the uncertainty in the mean, the actual r.m.s. is 

1/2 

^mock larger (which is specified in the inset label). We have ex- 
plored many more configurations than those shown (see details in 
Table[l]and Scc.lijJ and the same level of agreement was found. 



Let us now discuss the impact of redshift distortions and 
photometric errors in w{6). Figure [5] corresponds to the an- 
gular correlation measured in the mocks in configuration 
space from Sec. 14.11 (middle green symbols), redshift space 
from Sec. 14.31 (top red symbols) and photo-z space from 
Sec. 14.41 (low blue symbols). In solid green, red and blue 
lines we show the corresponding analytical predictions ob- 
tained from Eqs. (|5l7ll0IB8p . Black dashed line and symbols 
depict the model and measurements when accounting for all 
effects at once, with mocks introduced in Sec. 14.51 For red- 
shift distortions we used that fi = f{z = 0.5) = 0.7047 for 
our cosmology (since & = 1) while for photo-z we used the 
selection functions shown in Fig. [2] 

Error bars displayed in Fig. [5] are true ensemble errors 
corresponding to the standard deviation of the mean (i.e. 
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Figure 4. Large Scale Halo Bias (Real Space), in the angular 
correlation function for bins at z = 0.3 (top panel) and z = 
0.5 (middle and bottom). The bias is linear within few percent, 
and well within error bars. Except perhaps in the lower panel, 
that corresponds to cluster-scale mass halos with large errors fully 
dominated by their low abundance. See text for more details. 



a / \/ N,nocks , where a is the r.m.s variance of w{8) measure- 
ments). They increase from ~ 1% - 2% at 2° to ~ 3% - 4% 
at 4° (the angular BAO scale). Hence, Fig. [S] clearly shows 
that the model described in this paper performs remarkably 
well for both thin and thick bins, in configuration, redshift, 
and /or photo-z spaces. In redshift space it tends to under- 
estimate the measurements for < 3° at the ~ 2% level. 

Redshift space distortions produce a strong enhance- 
ment of the clustering signal (see (Nock et al. 2010) who 
find similar results for the projected correlation). This effect 
is also a strong function of scale, becoming more important 
for larger separations. Indeed, it increases the amplitude of 
the BAO bump by as much as a factor of 2.3 for the thin 
bin of A = 0.05(1 + z) ^ 180 h'^ Mpc (top panel of Fig. [5]) 
and 1.5 for the thick one of A = 0.15(1 + z) ^ 530 /i"^ Mpc 
(bottom panel of Fig. O . 

In turn, photo-z errors have the opposite effect to that 
of redshift distortions, decreasing the overall amplitude of 
angular correlations. This is simply because the projection 
in Eq. (O extends over a much larger range of scales (e.g. 
see Fig. [2]). Notably, this effect is not far from being scale 
independent. For the thin bin the impact is a bit stronger 
(top panel of Fig. [SJ. If we only consider photo-z effects, the 
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Figure 5. Redshift Distortions vs. Photo-z effects: We show the 
mean correlation over 125 mock measurements, and the corre- 
sponding predictions, in configuration (middle green line and 
symbols), redshift (top red line and symbols) and photo-z space 
(low blue line and symbols). Black line and symbols corresponds 
to mocks measurements and model when all effects are combined 
together. Redshift distortions (RD) depend on the parameter 
/3 = f/b = 0.7047 in our cosmology, while photo-z errors were 
set to (T^ = 0.06. Hence, top panel corresponds to a bin of width 
Az ~ while bottom to Az ^ Acr^. RD induces a strong and 
scale dependent enhancement of the correlation, counteracted by 
the smearing due to the photo-z uncertainties. Error bars shown 
correspond to the standard deviation of the mean of the ensemble. 



amplitude of w{9) decreases by ~ 60% when going from true 
to photometric redshifts (green to blue lines in the figure). 
In the case of a wide bin (bottom panel in Fig. [5]), although 
its width is approximately 4 times the photo-z there is still 
a reduction in amplitude of ~ 30%. 

The total angular correlation, including redshift distor- 
tions, photo-z, nonlinear evolution and bin projection effects 
is depicted by the dashed line in Fig. [5l In the case of bin 
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width comparable to the photo-z error the effect of photo-z 
dominates over redshift distortions (even though the impact 
of redshift distortions is larger for thinner bins). For a wider 
bin (bottom panel) the conjunction of effects leaves the am- 
plitude of correlation at the BAO peak almost unchanged, 
but it does introduce a strong scale dependent bias with re- 
spect to the real space clustering. Notably, the inclusion of 
redshift distortions to the w{9) in photo-z space enhances 
the amplitude by up to 50% at the peak position. Clearly, 
the appropriate inclusion of these contributions can be cru- 
cial in the analysis of real data. 

We note that the above conclusions assume an unbiased 
tracer. For biased ones the impact of redshift distortions, 
sensitive to f/b, is smaller. For instance, if we take b = 1.7 
at z — 0.5, that is a characteristic value for optically selected 
LRGs (Padmanabhan et al. 2007; Ross et al. 2007), but 
keep the same a, we find the amplitude of 'w{9) at the BAO 
peak position {9 ~ 4°) reduced by ~ 10% with respect to 
the b = 1 case, for a bin A ~ 4 x (but still gives 30% 
boost with respect to the case where redshift distortions 
is neglected all together). This, of course, might or not be 
accompanied by a corresponding change in the amplitude of 
errors depending on whether the galaxy sample is shot-noise 
dominated or not. 



5.4 Dependence on redshift distribution 

The very good match shown so far between the model corre- 
lation and the mock measurements relies in the fact that we 
have a perfect knowledge of the true redshift distribution. 
However when analyzing real data this can only be true to a 
certain extent. This is particularly so for photometric data, 
where different photo-z codes assign different redshift esti- 
mates to the same galaxies and thus could lead to slightly 
distinct redshift distributions in each redshift bin (e.g. Ab- 
dalla et al. 2008 and references therein). To understand 
the extent by which the angular correlation is affected by 
this potential unknown we first fitted the redshift distribu- 
tion for our narrow top-hat bin in photo-z space (assuming 
az = 0.06) with a normal distribution, 

^ exp[-{z - fif /2a'^]. (31) 



2Tva 

We found that fj. — 0.512 and a — 0.0628 leads to an excel- 
lent agreement between the analytic expression in Eq. (|3ip 
and the exact distribution shown by a dashed line the the 
right panel of Fig. (2] We next increased /i by 3% and sep- 
arately a by 10% and re-computed the angular correlation. 
These off-sets are similar to the variations recently found by 
Thomas et al. 2010 when analyzing the photometric sample 
of LRGs in SDSS with 6 different photo-z codes, and select- 
ing top-hat redshift bins comparable to the one discussed 
here {z — 0.5, Az/{1 + z) = 0.05). As a third example we al- 
lowed the center of the redshfit distribution to vary as much 
as the assumed photo-z, to /i = 0.572. The resulting correla- 
tions are shown in Fig. [6] We include also the measurements 
in the corresponding photo-z mocks (already presented by 
black symbols in top panel of Fig. [5]). Error bars shown are 
the r.m.s variance over the ensemble (rather than variance 
on the mean) to be representative of a real survey situation. 
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Figure 6. Dependence of w{d) on the redshift distribution. Top 
panel: Solid blue line is the correlation from the "true" distri- 
bution modeled as a gaussian of mean = 0.512 and variance 
a = 0.0628. Other lines are the model correlation from distribu- 
tions slightly off this one, as indicated in the labels. Also shown 
are mock measurements with errors corresponding to a 5000 dcg^ 
photometric survey with nominal photo-z (Tz = 0.06. Bottom 
panel: ratio of the different w(6) to the "true" case with /i = 0.512 
and cr = 0.0628. Varying the mean by 3% and 10% leads to shifts 
in the location of the BAO feature by approximately the same 
relative amounts (solid and dashed black lines respectively), in 
addition to changes in the amplitude. Increasing the variance by 
10% lowers the correlation by ~ 10% and introduces a scale de- 
pendent modulation of the amplitude but does not affect the peak 
(solid red line). 



As noticeable in the figure variations in /i and cr lead to 
analogous variations in wip). Incrementing the width of the 
distribution by ~ 10% reduces the amplitude of the correla- 
tion function by about 10% and introduces a scale dependent 
bias across all the angular range (see bottom panel). In turn, 
displacements in the mean redshift of the radial distribution 
shifts the angular position of the BAO peak by comparable 
percentage amounts (this is more evident when shifting /i 
by 10%, see dashed line in Fig|S}, in addition to the ampli- 
tude. If we instead lower fi or a the effects persist but are 
reversed. 

The main conclusion is that a poor knowledge of the 
true distribution (e.g. from uncertain photometric esti- 
mates) can lead to serious systematic effects depending on 
the statistical errors of the survey under consideration. 



THE ERROR 



MODEL VS. MOCKS II 
MATRIX 



We now move to test the performance of Eq. p8p in evalu- 
ating the full covariance matrix in 'w{9) measurements. We 
first discuss the diagonal component, or variance, in Sec. 16.11 
and then the reduced covariance in Sec. 16.21 
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6.1 Comparing the wiO) variance 

In Fig. [7] we show the r.m.s variance resulting from wiO) 
measurements in several ensembles of mock redshift bins 
in Real Space (top panels corresponds to narrow bin cases, 
bottom to their wide counterpart). The bin selection was 
top-hat in true redshift. The total number of mocks in each 
ensemble depends on the particular value of mean redshift 
and width of the bin (as detailed in Table [l]), but they are 
mostly over few hundreds, thus giving a unique statistical 
framework for our analysis. Solid lines in Fig. [7] corresponds 
to the prediction for the error, 

\l/2 



Reduced Covariance - Mocks 



Reduced Covariance - Theory 



Aw{e) = Cov(6',f 



(32) 



from Eq. (O using either Cf,Exact (solid blue) or C«,Limbor 
(solid red). The agreement between the theory and mocks is 
remarkably good for all the range in 6 of interest for large 
scale structure studies, in particular BAO, and all bin con- 
figurations. As we move to higher redshifts / wider bins 
the statistics becomes slightly poorer, but the agreement 
is still evident. In turn, the Limber approximation over- 
estimates the error by as much 30% for thin bins but it 
rapidly converges to the exact result for bins wider than 
~ 200 h'^ Mpc (e.g. the cases z = - A2/(l + z) = 0.15 
ov z = 0.5 — Az/(1 + z) = 0.15, at the bottom left panels). 

Let us now discuss the prediction of errors including 
photo-z effects and redshift space distortions. The left panel 
of Fig. [8] shows the r.m.s variance in the 125 mocks that 
incorporate photometric redshift uncertainties as described 
in Sec. 12.11 The mean redshift was z — 0.5 and the photo-z 
error assumed was = 0.06 (Gaussianly distributed). Note 
that in this case, the selection was top-hat in photo-z space. 
The true redshift distribution of objects used in modeling of 
Ce (as detailed in Eqs. (|19I20[) and Appendix IBlf) is given 
in Fig. H 

We considered two characteristic photometric redshift 
bin widths, one comparable to the photo-z error (A — 
0.075), and one almost 4 times larger (A = 0.225). In both 
cases the prediction in Eqs. H19j|20p . shown by solid blue and 
dashed red lines, is in very good agreement with the mock 
measurements (in empty and filled symbols). 

The middle panel of Fig.[H]shows instead the r.m.s error 
obtained from the same bins in redshift space. The predic- 
tion, in Eqs. (|19ll20ll2T1l . works to good accuracy in this case 
as well, for both wide and thin redshift widths. Notice that in 
the cases shown in this section the error is "sampling vari- 
ance" dominated (i.e. negligible shot- noise), and therefore 
is expected to scale with the signal itself. This is reflected 
in Figs. [7] and [8] where in photo-z space the r.m.s error is 
smaller than in real space, and viceversa for redshift space. 
In addition, there are some wiggly features in the measure- 
ments not reproduced by the model. These, we believe, are 
due to an insufficient number of mocks (as they also show 
up in Fig. [7] for ensembles with similar number of mocks). 

The right panel of Fig. [8]show the error measured in our 
most realistic 125 mocks including both photo-z and redshift 
distortions. They resemble (in amplitude) those in photo-z 
space indicating that the spreading of the galaxy distribu- 
tion due to photometric uncertainties is the dominant effect 
in front of redshift distortions for these redshift bins. 
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Figure 9. Measured Reduced Covariance Matrix (top left) vs. 
the Model in Eq. (18\ } (top rigfit) for the bin centered at z = 0.5 
and width Az/{1 + z) = 0.05 in Real Space. Bottom panel shows 
their ratio. Notably, the agreement is within 5% for most of the 
reduced covariance. The largest differences arise in areas far from 
the diagonal where the reduced covariance is close to zero anyway. 



6.2 Comparing the m(S) reduced covariance 

Once the diagonal error is well modeled we can turn to the 
prediction of the full (reduced) covariance matrix. We will 
concentrate in mocks that are "sampling variance" domi- 
nated, in real and redshift plus photo-z space and leave the 
case of "shot-noise" dominated error to Sec. 16.31 

Figure [9] shows the reduced covariance matrix, 
CovReducod(6',6'') = Cov{e,e')/Aw{e)Aw{e'), measured in 
324 mocks in Real Space at z = 0.5 — Az/(1 + z) = 0.05 
(top left panel). The corresponding model from Eq. (jlSp is 
depicted in the top right panel. Bottom panel shows the ra- 
tio of mock measurements to model. Remarkably they agree 
with each other within ~ 5% for the majority of elements, 
with the largest differences arising in areas far from the di- 
agonal where the covariance is small anyway. 

In order to make a more quantitative evaluation of our 
analytical expressions we instead plot in Fig.[Tn]four rows of 
the reduced covariance matrix, that is, the correlation be- 
tween 'w{9) and w{9') as a function of 9' (at fixed values 
of 6, as labeled in the plot). We have chosen to do this at 
our four mean redshits, z = 0.3, 0.5, 0.73, 1.1 and charac- 
teristics widths 0.15, 0.1, 0.1, 0.15 respectively (from top to 
bottom). This election refiects the fact that the calibration 
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Figure 7. Error in the Angular Correlation Function in Real Space. Predictions for Aw{9) = Cov{6, 9)^^'^ from Eq. IjlSj l using either the 
exact integration for Ci (blue sohd hne) or the Limber approximation (red solid line). Symbols arc the r.m.s dispersion in measurements 
of w(J)) in our ensembles of mock redshift bins in real space (see Sec. 14.11 and top panel of Table [TJ. Top panels show thin bins and 
bottom their wide counter-case. Notice that even as many as 100 mocks can still show sample variance fluctuations in the determination 
of the error. 



Photo z 


Space (o"^ - 


0.06) _ 


V z - 0.5 








Az/(l+z) 


0.05 - 




Az/(l+z) 


- 0.15 " 




















125 Mocka 


-° □ n 

--i ° □ □ 






;i 4 

fl[dog] 



_ 1 1 1 1 1 1 1 1 1 1 1 
\\ RedshifL 


Space (/J- 0.7) : 




^ z - 0.5 








Az/{l + z) = 0.05 - 


3 




A7/(l+7.) - 0.1 5T 


O 






X 






S a 
<i 












125 Mocks 

~ 




1 



e[dcg] 



. \ Photo — z 


-■■ Redshift Spaee _ 


"X z = 0.5 










Az/(l+z) - 0.05 - 


- °\ X 


Az/(l^z) - 0.15 " 


□\ 

\ 

□ 

□ \ 
□ \ 

° □ "\ 
□ 

125 Mocks 

< < 1 < < < < 1 < < 





1 2 


:! 4 a 
e[dng] 



Figure 8. Errors in w{8) including photo-z and redshift distortions effects. Left panel shows the r.m.s variance in measurements of w{9) 
in two top-hat redshift bins in photo-z space, both centered at (photometric) z = 0.5. The assumed photometric error is cr, = 0.06. Central 
panel shows the same quantities measured in redshift space (i.e. top-hat selection is done in true redshift). Right panel incorporates both 
effects, redshift distortions and photometric errors. The corresponding analytic predictions are given by the solid and dashed lines. In 
each panel, empty squares corresponds to a "narrow" bin (A2: ~ cr,) and filled triangles to a "broad" one (A^ ~ ^icr^). 



of photometric redshifts (mimicked here by the bin width) 
are better at intermediate redshift, and worse towards low 
and high z. Solid lines in Fig. [TD]are the predictions from 
Eq. dUll using Ct from Eq. (fB5)) . 

In all cases, the values of 9' were selected in such a 
way to span the whole angular regime where the analytical 
model for w{9) is accurate. In particular, the third from the 



left value of 9' corresponds approximately to the angular 
position of the BAO peak 

This figure reflects the high degree of correlation be- 
tween 6'-bins even when widely separated, which is charac- 
teristics of Configuration space. Nonetheless the theoretical 
estimates accounts for the whole shape, particularly close 
to the diagonal. Away from it there are wiggly features in 
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the measurements that we identify with an insufficient num- 
ber of mocks (e.g. only the upper panel with ~ 400 mocks 
does not show these features). The bottom panel shows an 
extreme case of using only ~ 40 mocks, and the associated 
level of noise. 

Figure[TT]shows the same quantity now measured in our 
mocks including photo-z {a^ ~ 0.06) and redshift distortions 
effects (P — 0.70). Here the modeling, which follows from 
Eqs. P9I20I21IB5[I Eq, works remarkably well close to the di- 
agonal. Away from it, at angular separations of 6 ~ 2 — 1.5°, 
we see that the model under-estimates the reduced covari- 
ance. This is more noticeable when the reduce covariance 
drops 0.5. This could be due to shortcomes of the theoretical 
model or to an insufficient number of mocks. Nonetheless, 
we have checked that this difference does not translate into 
different estimates of cosmological parameters when using 
the theoretical or ensemble covariance matrix. This will be 
presented in a forthcoming paper (Ross et al. 2010). 

Figure. [T^] shows instead the off-diagonal elements of 
the error matrix Cov{9, 6 -\- /S.0) for fixed value of A0, and 
as a function of 9. Top panels correspond to two bins in 
real space (as labeled) while bottom panels to our mocks in 
photo-z -f redshift space at z = 0.5 and A2:/(l + z) = 0.015 
(left) or 0.05 (right). The value of /S.9 was chosen to sample 
the full angular width of the BAO feature. In all, the model 
performs quite well, with no general or obvious systematic 
deviations. It gives accurate results close to the diagonal (i.e. 
smallest AS) and can deviate by ~ 10% — 20% for large A9 
(in a regime where covariance is noisy nonetheless). 

We finish this section comparing theory and mocks at 
the level of the Singular Value Decomposition (SVD) to il- 
lustrate how well we can model the covariance matrix in a 
different basis, one which is sometimes used to analyze real 
data. The top panel of Fig. [T3] shows the Singular values 
of the covariance matrix measured in the ensemble of 392 
mocks with z = 0.5 - A2/(l + z) = 0.15. For the SVD we 
used a range of scales [0.25° — 8°], divided in 34 bins of width 
0.3° (but results are robust for other ranges and binnings). 
The analytical model, using the linear power spectrum (solid 
blue line), describe accurately 10 to 15 singular values. We 
have tested that using more than 10 or 15 singular values the 
estimate of cosmological parameters have negligible impact 
in the outcome (see also (Eisenstein & Zaldarriaga 2001)), 
as they describe very short-range correlations (see bottom 
panel) . These results will be presented in a forthcoming pa- 
per (Cabre et al. 2010). There is however, a simple way to 
improve on the agreement. In dashed line we show the re- 
sults when using the "measured" Cz spectra in Eq. (|18|l EEI- 
This indicates that the recovery of these high singular values 
is affected by the high I tail of Ce where non-linear effects 
increase the power over their linear value. On the one hand 
this will have little importance in practical situations where 
this regime will be most probably dominated by shot-noise. 
On the other hand it can be easily modeled using fits to the 
nonlinear power spectrum, such as halof it, into the Limber 



Wc used the publicly available SpICE code (Szapudi et 
al. 2001) to measure the Ct spectra, which is particularly suitable 
to include mask effects due to partial sky coverage 
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Figure 10. Rows of the Reduced Covariance Matrix in Real 
Space. Reduced covariance between w{6) and w{6') for a fixed 
value of 8' . From top to bottom we show bin configurations with 
larger width at low and high rodshifts and smaller at intermediate 
values (resembling the characteristic performance of photo-z esti- 
mates). Solid lines are the predictions from Eq. I I18I I. Notice that 
close to 200 mocks are necessary for a robust estimation of the co- 
variance. In each panel the third from left value of 8' corresponds 
to the angular BAO scale. 
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Figure 11. Same as Fig. 9, but for measured in moclcs 

including photo-z and redshift distortions effects. Top panel cor- 
responds to a top-hat bin of width l\z/{l + z) = 0.05 centered at 
(photometric) z = 0.5, while bottom panel to A^/(l + z) = 0.15. 



formula. In the bottom panel of Fig. [13] we show the singular 
vectors corresponding to singular values 1, 3 and 6. 

So far we have not tested our model against measure- 
ments of w{9) covariance between different redshift bins. The 
full covariance matrix in this case can be easily obtained 
with the same formalism described in Sec. [3] with the full 
sky variance of spectra between bins i and j given by, 

Var(C77) = ^^[{Cl + + n,) + [Cff] (33) 

what leads to (for partial sky coverage), 

Cov;^, = ^^^P,(cose)P,(cose')[(Cl + nO 



(34) 



Here C^-' is obtained by simply replacing — >■ '^i,^'^ j^i 



in Eq. ([T9|l . where "^i is given by Eq. (|20|) with the radial 
selection corresponding to the i — th redshift bin. 



6.3 The impact of shot-noise 

Previous sections showed that the expression in Eq. (|18p can 
describe remarkably well the error due to sampling variance 
in the measurement of the angular correlation function in 
redshift bins. 

We now turn into the problem of describing the error 
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Figure 12. Off-diagonal Covariance in Real Space (Top Panels) 
and in Redshift + Photo-z Space (Bottom Panels). Panels show 
the reduced covariance matrix between w{8) and w{8 -\- AO) for 
increasing values of 9 and fixed A9. Mean red-shift and width of 
each bin configuration arc labeled in the plot. Solid lines are the 
predictions from Eq. I|18|l while symbols denote measurements in 
the corresponding mocks. 



due to shot-noise. To this end we will concentrate on differ- 
ent halo samples as tracers (from Sec. 15. 2p and test whether 
the standard Poisson shot-noise term in Eq. (|18l) can account 
for the increase in errors due to their low number density. 

For concreteness we focus on two characteristic z-bin 
configurations. Figure [14] corresponds to halos with M > 
10" Mq in the bin at z ^ 0.3 and width Az/{1 -\- z) = 
0.15. This sample has a bias b{z = 0.3) = 2.36 (see Fig.©, 
what corresponds to 6 ~ 2 if linearly evolved to 2 = 
(Fry 1996). The sample has an angular abundance of 
N/Q ~ 17 halos/deg^ what gives a shot-noise contribution, 
n/N, of order 1.8 x 10"*sr^. This corresponds to nb^ C« ~ 1 
at f ~ 200. 

Top panel of Fig. [14] shows the r.m.s. error measured 
in the ensemble of 392 mock redshift bins while lines are 
the predictions from Eq. (|18p using the exact Ce integration 
and including Poisson shot-noise (solid blue) or neglecting it 
(dashed blue). As we see the presence of shot-noise increases 
the error by 20% (in this case), but this can be very well 
modeled by the addition of the simple Poisson shot-noise 1/n 
contribution to Ce. Bottom panel shows the singular values 
of the covariance matrix SVD. Almost none of the singular 
values is well recovered when shot-noise is neglected. 

In turn, Fig. [15] corresponds to halos with M > 2 x 
10" Mq at higher redshift, in the bin 2 = 0.5- Az/(1-|- 
z) = 0.1 (the halo mass-cut was chosen slightly higher at 
higher redshift to resemble a flux-limited survey). In this 
case the bias is b{z — 0.5) = 2.93 {b ~ 2.3 at 2 = 0) with 
an angular abundance of ~ 11 halos/deg'^ {nb^ Ce ~ 1 at 
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Figure 13. Singular Value Decomposition of the Error Matrix 
measured in 392 mocks at z = 0.5 — Az/(1 + z) = 0.15. Using the 
hnear power spectrum is possible to reproduce very accurately ~ 
15 singular values (top panel) and their singular vectors (bottom 
panel). Including nonlinear clustering effects into leads to a 
good match for all singular values. The case shown corresponds 
to unbiased tracers and negligible shot-noise. 



I ~ 200). The shot-noise increases the error by ~ 30% on 
top of the samphng variance (solid vs. dashed lines in the 
top panel) and plays a crucial role in the SVD eigenvalues 
(bottom panel). For this case the SVD was done using 40 
bins of width 0.2° in the range [0.2° - 8°]. 

Remarkably, in both cases (Figs. [Til and [T5|) . the inclu- 
sion of the simple Poisson white-noise on top of the sampling 
variance in Eq. (|18|l accounts very well for all singular values 
as well as the diagonal error (solid lines). 

Arguably, the halo samples used in this test are too 
much dominated by shot-noise. We have also done the exer- 
cise of gradually increasing the relative contribution of shot- 
noise in front of sampling variance by increasingly under- 
sampling the dark matter field. As expected, the error is 
always recovered by the model just by increasing the Pois- 
son term in Eq. (|18p accordingly. 



6.4 Independence on bin-size 

When estimating a covariance matrix one important issue to 
be considered is whether the binning used for the measure- 
ments have an impact on the recovered correlations. Hence, 
in this section we aim at showing that our estimates for the 
covariance matrix are not affected by simple rebinning of 
the measurements, and so our model remains applicable in 
a general sense. 

The selection of bin size, and binning strategy (e.g. lin- 
ear vs. logarithmic), is particularly important when analyz- 
ing real data because one might be able to optimize the 
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Figure 14. The effect of shot-noise at z = 0.3. Diagonal error 
(top panel) and Singular Value Decomposition of the Covariance 
Matrix (bottom panel) for the bin z = 0.3 and Az/(l-|-2) = 0.15 
for a sample of halos with M > xlO^^ Mq. The (Poisson) 
shot-noise term increases the error by ~ 20% on top of sampling 
variance (solid and dashed lines are the model with and with- 
out the 1/n term) and is critical to recover the correct singular 
eigenvalues. We used 392 mocks for this case. 



recovered model constrains by a better split between true 
signal from noise. This is not our goal but still is appropri- 
ate to discuss our binning choices. 

In general, a too narrow bin could lead to noisy (due to 
shot-noise in the bin) and very correlated errors. Too wide 
would smooth and "diagonalize" the covariance but at the 
expense of leaving only few data points where to compare 
with the model. In turn, a logarithmic instead of linear bin- 
ning is preferably when dealing with a large dynamic range 
in scales. For the problem considered in this paper, the an- 
gular correlation at large scales, we are never beyond a fac- 
tor of 10 in dynamic range in 9, see for instance Fig. [3] In 
turn we want to sample this range uniformly, which inclined 
ourselves to linear binning. 

In addition, since we have a feature in our signal that 
we want to resolve (the BAO bump) we need several bins 
across it which sets our maximum bin-size. We then chose 
to have ~ 5 — 10 bins across it which led to A9 = 0.3° at 
z = 0.3, AO = 0.2° at z = 0.5 - 0.73 and AO = 0.1° at 
z = 1.1. Still, we could have selected thinner bins. 

Therefore, in Fig. [16] we show the measured diagonal 
error and the reduced covariance (i.e. same as Figs. [7] and 
I10|) for the mock configuration centered at z = 0.73 (in real 
space) for a bin-size half the one used throughout the paper. 
Top two panels correspond to the diagonal error while bot- 
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Figure 15. The effect of shot-noise at z = 0.5. Diagonal error 
(top panel) and Singular Value Decomposition of the Covariance 
Matrix (bottom panel) for the bin 2 = 0.5 and Az/(1 + z) = 0.1 
for halos with J\/ > 2 X lO^'' h^^ Mq. Conclusions shared with 
Fie. 1141 Wc used 175 mocks for this test. 



torn to rows of the reduced covariance matrix. Both set of 
measurements agree with each other. The ones with smaller 
A6 seem slightly above in diagonal error and below in re- 
duced covariance. But, as expected, they also show more 
noisy. In all, our model (shown in solid lines) describes both 
equally well. 



7 CONCLUSIONS 

The field of large scale cosmological structure will undergo 
an unprecedented era in the immediate future with several 
large observational campaigns proposed or under implemen- 
tation. Many of these surveys, such as DES, PanStarrs and 
LSST, will use photometric techniques to estimate the radial 
position of galaxies instead of measuring their full spectra, 
which is more time-demanding . This gain allows to sur- 
vey wider areas and fainter objects, but at the exchange of 
increasing the uncertainties in the true redshifts and degrad- 
ing the radial clustering amplitude. The proposal is then to 
split the data into redshift bins and exploit the information 
available from angular correlation functions, provided with 
an accurate determination of the measurement errors. Yet, 
this goal can only be accomplished if we develop accurate 
models for the signal and its errors that take into account all 
relevant efTects and are robustly tested in realistic scenarios. 
In this paper we addresseded this issue in a comprehensive 
way. 
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Figure 16. Independence on angular bin-size. The figure shows 
how the diagonal error (top two panels) and reduced covariance 
(bottom panel) vary when one halves the angular bin-size. Recov- 
ered correlations becomes noiser but no appreciable systematic 
diff'erence with the model is found. 



We first developed an extensive set of mock catalogues 
(in the form of redshift bins) reproducing the angular cov- 
erage and radial distribution of a photometric survey like 
DES. We did it in increasing steps of realism, first in real 
space, then including redshift distortions or photometric er- 
rors and finally altogether. For this we used a large N-body 
simulation (of ~ 450 Gpc"^ simulated volume) provided 
by the MICE collaboration ( |http : //www . ice . cat /mice p . 
These mocks can be regarded as independent realizations 
as their volume overlap is minimal and therefore provide a 
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unique statistical framework for model testing (see Table [T]). 
We claim they are also equivalent to a light-cone analysis as 
we are doing narrow redshift bins, which involve negligible 
evolution. 

We next put forward a model for the angular correlation 
function w{9) accounting for all the relevant effects, namely 
bin projection, nonlinear gravitational evolution, linear bias, 
redshift space distortions and photo-z errors. 

An exhaustive comparison of our model for 'w{6) against 
the mock measurements showed a remarkably good agree- 
ment for a wide range of 6 and scenario (real space, photo-z, 
RSD and photo-z + RSD), validating the treatment of the 
different effects and opening the door to the use of this probe 
for real data analysis. Nonlinear gravitational evolution pro- 
duces minor distortions in the correlation pattern after the 
bin projection. In turn, analysis of halo angular clustering 
showed a very good consistency with a linear bias assump- 
tion. The interplay of photo-z and redshift distortions is the 
most important consideration regarding the shape of «;(6'). 

Redshift space distortions introduces a large and scale 
dependent enhancement of w{6), that can reach a factor of a 
few at BAO scales (see Fig. [5]). For our widest bin (where the 
effect should be least important) it still rises the amplitude 
of w{9) by ~ 50% at 9b AO (with respect to real space). 
Conversely, photo-z effects lower the clustering amplitude 
by extending the effective bin projection. For example, for 
the widest bin mentioned before (with Az ~ 40-^) we find 
that the two effects counter-act each other at 6b AO, but 
leave a scale dependent signal towards smaller angles. For 
narrower bins photo-z dominates, but redshift distortions is 
certainly not negligible. These trends were concluded from 
both, our photo-z -I- RSD mocks, and the analytical model. 

In turn, we showed that the Limber approximation 
should not be used in precision analysis of large scale clus- 
tering as it leads to the incorrect shape of w{9) in the full 
range of interesting scales, and severely misestimates the 
amplitude of the Ci spectra for £ ^ 40 — 50. This is convinc- 
ingly shown in Figs. IBll IB2I and IB3I Another interesting 
issue considered was the impact of uncertainties in the true 
redshift distribution of objects. This showed an important 
aspect since it can lead to several percent changes in both 
the shape of w{9) and the BAO peak position given the 
accuracy of present photo-z estimators. 

We would like to highlight that, in the process of de- 
scribing 'w{d), we have also investigated a model for the 3- 
d matter correlation function that is able to reproduce the 
clustering signal in a broad range of scales and redshifts with 
only 2 parameters. This, discussed in detail in Appendix [Xl 
can be of grand interest for future spectroscopic surveys such 
as BOSS, Hetdex and WiggleZ. 

We have made an equally exhaustive effort in modeling 
and testing the full error matrix characterizing the mea- 
surements of 'w{d). The covariance matrix is often estimated 
from the data itself, using internal or re-sampling methods 
such as Jack-knife or bootstraping. However, their limitation 
is still a matter of some debate (Norberg et al. 2009). Hav- 
ing a full theoretical model is thus very suitable for present 
and future analysis. 

We took into account partial sky coverage by assuming 



that Coy{9,9') scales as /J^^. The full sky situation is then 
easily treated by translating errors from harmonic space, 
where the covariance matrix is diagonal and proportional to 
Ct. Through the angular power spectra we included the same 
effects considered for w{9) into Coy{9,9') (photo-z, redshift 
distortions, bias) for a typical survey with fsky = 1/8. 

Our modeling of errors recovers the correct variance in 
TO (6) as measured in the mocks for a wide range of bin config- 
urations , from low to high redshift {z ~ 0.3— 1.1) and from 
thin to thick bins (100/i^^Mpc - 550/i^^Mpc). And this 
conclusion extend to the more realistic cases where we in- 
cluded photo-z effects and redshift space distortions. In addi- 
tion we used different halo samples to study cases where the 
shot-noise was comparable or larger than the sample vari- 
ance component of the error. This regime was also nicely 
described analytically by adding a standard Poisson shot- 
noise contribution to the variance of the Ci spectra. 

Moreover, and thanks to the large number of mocks con- 
structed, we measured the full covariance matrix with high 
precision in different configurations. We find that at least 
150 — 200 mocks are necessary for a well defined reduced 
covariance, but this is discussed more properly in Cabre et 
al. 2010. Remarkably in all cases tested, the modeling re- 
covers very accurately the true error matrix. 

In a parallel line of research we have tested the recovery 
of cosmological parameters using our model for 'w{9) and the 
theoretical expression of the covariance matrix. And com- 
pared this with the same analysis but using the true covari- 
ance as measured in the mock ensembles. Indeed, the best- 
fit values and errors contours coincide for both approaches 
(Cabre et al. 2010) giving very encouraging prospects for 
the use of our analytical expressions in real data analysis or 
in realistic forecasts of upcoming photometric surveys. 
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APPENDIX A: SPATIAL CLUSTERING 

In Sec. 12.21 we presented a parametric model for the spatial 
correlation function and argued that a single set of best-fit 
parameters could be used to describe the clustering from low 
to high redshift. In this appendix we show how this model 
performs against measurements of 3-d clustering in N-body 
simulations, in particular as a function of redshift. 



AO.l Nonlinear Gravitational Clustering 

The top panel of Fig. lAll shows the spatial correlation func- 
tion measured in MICE7680 at z — 0.3 compared with the 
parametric model given in Eq. (|10p for best-fit parameters 
Sbao = 5.54 Mpc and Amc ~ 1.55. To simplify the mea- 
surement we only used a cubic sub- volume of the full comov- 
ing output, of length Lbox ~ 2560 Mpc. Error bars were 
obtained from the scatter among 125 Jack-knife volumes of 
the full box. We see that the model performs very well down 
to scales ~ 20 Mpc at this redshift. Notice how the con- 
volution of linear theory with a Gaussian smoothing (dot 
dashed line) leads to a slight increase in amplitude above 
the measurements for r 60 /i^^ Mpc. Measurements follow 
linear theory at these scales. The effect of the mode-coupling 
term in Eqs. (|8ll0p is mainly to correct for this mismatch. 

From theoretical grounds we expect the smoothing 
length to be determined by the amplitude of large-scale ve- 
locity flows and therefore given by (Bharadwaj 1996; Crocce 
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Figure Al. Spatial Correlation Function: measured in the z = 
0.3 comoving output of a large volume N-body run {MICE7680) 
compared with the parametric model given in Eq. ^ with best- 
fit parameters s^qq = 5.54 Mpc and Amc = 1-55 (Top panel). 
The length scale used in the y-axis is rsAO = 110 fe"^ Mpc. Bot- 
tom panels display the ratio of the measured correlation function 
(at the given redshift) to our model correlation, obtained after 
scaling with redshift the z = 0.3 best-fit values. 



and Scoccimarro 2006b; Eisenstein et al. 2007; Matsub- 
ara 2008), 



ao ~ [(1/3) / d'qPLin(g)/g']'/' = 5.42 Mpc h-^ (Al) 



Sba 



which is within 3% agreement with the recovered best-fit 
value. In turn, the lowest order estimation for Amc using 
perturbation theory yields 34/21 ~ 1.62 independently of 
cosmology and redshift (Crocce and Scoccimarro 2008), 
which is also very close to our best-fit value 1.55. 

Hence, it is possible to use these theoretical estimates 
as a starting point in the modeling instead of using fitting 
parameters. Notice however that, if one extrapolates this 
model to describe the clustering of tracers, the actual val- 
ues for Sbao and Amc might depend on the particular tracer 
under study (e.g. on halo mass) (Sanchez et al. 2008). In 
addition, the rather large error bars obtained from present 
day data do not put severe constraints on the values of S(,ao 
and Amc (Sanchez et al. 2009; Percival et al. 2010). 

As discussed in Sec. l2.2l one interesting aspect of a the- 
oretical model when it comes to analyze data split into con- 



tinuous redshift bins is the possibility to understand the red- 
shift evolution of the nuisance parameters involved, which 
in turn allows one to fit the least number of parameters (i.e. 
just at one fiducial redshift). With this approach in mind 
we now show that our model can indeed describe the 3-d 
clustering from low to high redshift using a single set of 
best-fit parameters. And this property will remain after the 
projection into angular correlations in Eq. ((5]). 

The bottom panel of Fig. lAll shows our model in 
Eq. (jlOp against measurements of ^ in the comoving out- 
puts of MICE7680 at z = 0.5, 0.73 and 1.1. The values for 
Sbao and Amc were taken from a best-fit analysis to ^(r) at 
z = 0.3. In all cases the agreement is very good, similar to 
that at z = 0.3. 

Notice that conclusion in this appendix relate to 3-d 
clustering in general, regardless of photo-z, hence are also 
relevant for spectroscopic surveys such as BOSS0, Hetdex 
and WiggleZ 0. 



APPENDIX B: THE LIMBER 
APPROXIMATION 



Bl Ce power spectrum 
Limber formula 



exact evaluation and 



In this appendix we discuss the way we implement the nu- 
merical integrations that lead to the exact Ce spectra in 
Eqs. (|19I20|I . and how this exact result compares with the 
widely used Limber approximation (Limber 1953). 

Let us first recall the derivation of the exact expression 
of the angular power spectrum in terms of the spatial one. 
This is done by expanding the density field in Eq. ([1} in 
Fourier Series and subsequently expanding the plane wave 
into spherical harmonics 0- After some straightforward ma- 
nipulation this leads to, 



atm = 47ri / dz (t>{z) 



5(k,z)j,(fcr(z))F;^(k), 



(27r)3 

(Bl) 

where ji are the spherical Bessel functions of order I. In- 
serting Eq. HB1|) into Eq. 1)13^ and using the orthogonality 
relation of the spherical harmonics gives Eqs. (|19I20|I . 



1 

2^ 



A-Kk^dkP{k)-^t{k) 



dz(t>{z)D{z)Mkr{z)) 



These integral expressions are numerically expensive to 
compute due to the oscillatory behavior of the spherical 
Bessel functions ji{x) for a: 3> 1. It is then desirable to seek 
for ways to improve their convergence. The most popular 
short-cut to evaluate Eqs. (|19I20|) is the so-called Limber 
approximation (Limber 1953; Kaiser 1992; Kaiser 1998) 



13 'http : //www . sdss3 . org/ 1 

http://hetdex.org7] 
1^ http://wigglez.swin.6du.au/ 
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that follows from the orthogonality relation of the spherical 
Bessel functions (e.g. see Loverde and Afshordi 2008), 



dkk ji{kri)ji{kr2)P{k) 



(B2) 



2 H 



P(k : 



1 + 1/2, 



that leads to the well-known expression, 



dz(t>\z)D\z)P{{e + l/2)/r{z)) 



AM{z) 
r{zY 



(B3) 

Strictly speaking this is valid for £ ^ 1 (that is, small an- 
gles), but in practice it coincides with the exact expression 
already by £ ~ 50 — 100 as discussed in Fig. IBll (see also 
Blake et al. 2007; Loverde and Afshordi 2008). Therefore 
we proceed as follows. First, let us concentrate on linear 
theory fluctuations. Then, inspired by Eq. ((8| (and Crocce 
and Scoccimarro 2006b; Eisenstein et al. 2007; Matsub- 
ara 2008) we decompose the linear power spectrum as 



P{k) = Pun{k)G{k) + PLin(fc)(l - G(fc)), 



(B4) 



with G = exp(— fc^Sjjjo) a-nd Sbao taken from Eq. ()A1|I . The 
first term dominates at large scales and accounts for the 
degradation of BAO bump, leading also to the correct shape 
of the 3-d correlation function in a broad range of scales 
(Fig |Al[l . Since this is our regime of interest we evaluate this 
piece using the exact integration, that can now be integrated 
up to an upper bound such that fcmax Sbao ~ 4 — 5 where 
Pliii exp(— fc^s^jjo) is already suppressed by several orders 



of magnitude. The second term in Eq. (|B4p is non-negligible 
only at high-^ therefore it can be computed using the Limber 
formula, Eq. (|B3|) . In all, we evaluate the exact asFI 

Cl = C,.Exact(PLin X G) + C^Limbcr (^Lin X (1 - G)) (B5) 

In the top panel of Fig. IBll we show the average Ci 
power spectrum measured in 324 real space mocks centered 
a,t z = 0.5 with Az/(1 + z) = 0.05 (using SpICE, see Sza- 
pudi et al. 2001), and compare it with the exact integration 
(solid blue line) and the Limber evaluation (solid red line) 
of the first term of Eq. (|B4[I . Clearly, the Limber approx- 
imation fails by a large factor at low-/, which is in broad 
agreement with the literature (Blake et al. 2007; Loverde 
and Afshordi 2008). However, by £ ~ 60 it fully coincides 
with the exact computation. The contribution of the second 
term in Eq. (|B4|I is only relevant for I > 100 (dashed black 
line) and therefore it is safety computed using the Limber 
approximation. We have checked that these conclusions re- 
main true for different z and Az. Bottom panel of Fig. IBll 
shows the full Gf-Limber (red plus dashed lines in top panel) 
and the measured spectra in ratio with our model for the ex- 
act Ct spectra, given by Eq. (|B5P (i.e. blue plus dashed in 
top panel). 

Notice that we have only attempted to evaluate the 
linear spectrum. Nonlinear effects can be taken into ac- 
count by replacing Pliu in Eq. (|B4|I by fits to nonlinear 




Figure Bl. Top Panel: Angular Power Spectrum measured in 
324 Real Space mock catalogues of mean redshift S = 0.5 and 
width A.z/{l + z) = 0.05, compared with our theoretical estimate, 
which we brake into to additive contributions (Eg. lBSII . \ow-l from 
the exact integration in Eq. I|19|l (solid blue lino) and large-£ from 
the Limber approximation in Eq. | |B3| | (dashed black line). Solid 
red line shows the \ow-t term computed with the Limber approx- 
imation, which leads to a severe under-estimation of the large 
angle power (Z < 30). This could critically impact the compu- 
tation of w{d) and its covariance at large BAO scales. Bottom 
Panel: Ratio of the full C^-Limber (red line) and the measure- 
ments to our estimation of the exact (linear theory) spectra. 
Displayed error bars correspond to the ensemble r.m.s variance. 
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Figure B2. Angular Correlation Function measured in the 324 
mock bins compared with the theoretical predictions obtained 
from Eq. Q (dashed line) , or from C^-space through the Legendre 
polynomials and Eq. II14I I. Using the Limber approximation to 
compute the leads an incorrect shape for w{d) (solid red line). 
In turn, the exact Ci integration agrees as expected with Eq. JSjl 
(solid blue line). Displayed error bars correspond to the ensemble 



r.m.s variance. 



The difference between our approach and a straightforward 
use of the exact integration for all the I range is under 1%. 
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power spectrum, such as halofit, or exact analytical ex- 
pressions (Crocce and Scoccimarro 2006a; Matarrese & 
Pietroni 2008; Matsubara 2008; Taruya et al. 2009). How- 
ever we have found that for our purposes this gave no ad- 
ditional contributions to the error estimations. In fact, the 
second term in Eq. (|B5|) had a negligible impact in the es- 
timate of Aw{9) for all cases we had considered. It only 
gave a contribution when evaluating the covariance matrix 
in cases where the shot-noise of the sample under consider- 
ation is fully negligible (see Sec. 16. 3p . We finish by noting 
that linear redshift space distortions fully cancel for large I 
(Padmanabhan et al. 2007), therefore they only need to be 
accounted for in the first term of Eq. HB5|I (following Eq. l2H) . 



B2 Correlation Function 
Redshift Space 



results in Real and 



The inaccuracy of the Limber approximation also affects the 
calculations in configuration space. For instance, the way 
in which the disagreement in Fig IBll translates into Real 
Configuration Space is depicted in Fig. IB2I Solid red and 
solid blue lines were obtained using Eq. (|14p and d from the 
Limber or the Exact evaluation respectively (those shown in 
top panel of Fig. IBlf) . Dashed line follows from projecting 
the corresponding 3-d correlation function, as in Eq. ([S}. 
Here, the failure of the Limber approximation to describe 
the BAO bump is more evident. The exact d integration 
yields the same result as the one from Eq. ([5]), as expected. 

These limitations are even more severe when redshift 
distortions are taken into account. Without lost of generality 
we will use a top-hat selection in the following equations to 
illustrate the problem: 



1 /-Xii 



dxi 



dx2 £.(ri,r2) 



(B6) 



where A;^ = Xmax — Xmin- The correlation 5(ri,r2) is only 
a function of the relative separation between ri and r2. Be- 
cause of redshift distortions, this is in fact a function of n 
and a, the light-of-sight and transverse separation. For small 
angles and distance observer, i.e. the Limber approximation, 
we can take tt — X2 ~ Xi ^-nd = XiXa^^ LJ- We can then 
change variables in the above integrals from xi and X2 to n 
and a 



w{e) = 



da 



div ^(tt, a) 



(B7) 



This result is valid both in real and redshift space. In 
real space, ^{n, a) is replaced by the isotropic correlation 
^(r) with — tv^ -\- . In redshift space we use the linear 
theory prediction (Kaiser 1987; Hamilton 1992): 



to the line-of-sight (l.o.s) and are the multi-poles of the 
correlation function in terms of Legendre polynomials Pt, 

2£+l 



6(s) 



^{n,a)Pe{fi)dfi, 



(B9) 



with s = y/a^ + tt'^ and fi is the cosine angle with the l.o.s. 
For the Kaiser model one has (Hamilton 1992), 



Us) = b — 



as) + l^s) - llis) 



(BIO) 
(Bll) 
(B12) 



where b is the bias of the sample (assumed linear and local), 
P — f/b, f = din D/ din a is the growth rate factor and 



U) = / U')r'^dr', 



U) 



' (,{T')r'^dr' . 



(B13) 
(B14) 



We will show next that Eq. (|B7[l turns out to be a bad 
approximation to model BAO scales. But it provides a good 
way to illustrate why redshift space distortions are so impor- 
tant for the BAO detection for small (to moderate) photo-z 
bin widths A^. For large values of A;^ the integral repro- 
duces the real space correlation (because the total number 
of pairs are preserved by redshift space distortions), while 
for A;^ < 500/i~^Mpc the tt integral is truncated by the 
radial boundary of the top-hat window. Line-of-sight pairs 
separated by tt > A^ do not enter in the redshift bin and 
are therefore not integrated. Smaller pairs are also affected 
because many of them are missing at the boundaries. This 
missing pairs produce a distortion in the measured w{9) as 
compared to real space correlation (see Fisher et al. 1994, 
Padmanabhan et al. 2007 and Nock et al. 2010). 

Both real and redshift results are quite inaccurate under 
the Limber approximations. This is illustrated in Fig. [13 
which compares the calculation of the above integral in real 
and redshift space in the Limber approximation (dashed 
lines) with the corresponding exact results (continuous line) 
by integrating Eq. ()B6|I . Note how the results in real space 
are very similar to the corresponding calculation in Fig. IB21 
based on power spectrum calculation. The results in red- 
shift space for the Limber approximations show even larger 
deviations than in real space. The BAO peak, which shows 
around 6 ~ 4° is clearly distorted by the Limber approxi- 
mation. We conclude that the Limber approximation is not 
good enough for precision BAO modeling and we use the 
exact integration throughout this paper. 



^(a, tt) = Us)PoM + Us)P2M + Us)P4M (B8) 
where tt and a represent the separation along and transverse 

For a non-flat cosmology we need to replace x by the comoving 
angular diameter -D(x) distance here: = D{xi)D(x2)9^ 



APPENDIX C: 
SPACE 



ERRORS IN HARMONIC 



In this appendix we comment briefly on the correlation be- 
tween different modes in Harmonic Space induced by a par- 
tial sky coverage. 
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Figure B3. Angular correlation in the Limber approximation 
(dashed line) in Eq. IIB7I I in both real (bottom dashed line) and 
redshift space (top dashed line). Continuous lines show the cor- 
responding exact calculations using Eq. l(B6j. 



Equation (|18|) can be naively interpreted as if, in the 
presence of a partial sky coverage, the errors in Ci increase 
by a factor 1/ \/J7ky, that is 



(AGO' 



fsky 2-^ H~ 1 



(Cl) 



with the co-variance matrix remaining diagonal. In reality, 
the presence of boundaries decreases the diagonal error but 
it also introduces co- variance between different £ modes (e.g. 
Cabre et al. 2007 and references therein). 

This is clearly depicted in Figure [CT] where we show the 
error in Harmonic space from Ci measurements using 392 
mocks with z = 0.5 and Az/(1 + z) = 0.1, and compared 
them with the expression above. Top panel corresponds to 
the measured diagonal error, which indeed is much smaller 
than that from the naive interpretation in Eq. (IClf) given by 
the dashed line (even smaller than the full-sky limit shown 
in solid line). Bottom panel, corresponding to Cov{£,£') for 
fixed values of shows the emergence of co- variance be- 
tween different modes distributed in a range of £ values ap- 
proximately given by ±l/fsky In all, the "total error" is 
larger than in full-sky. 

This conclusion can be naively founded from the fact 
that for a given value of £' the integral. 



d£ Coy {£',£) 



fsky{2l' + l) 



(C2) 



what leads to the simple interpretation that, when the sur- 
vey is reduced from a full-sky limit by a fraction fsky, the 
diagonal covariance rises from its value 2/(2/-|-l)bya fac- 
tor 1/faky but then "leaks" towards other £ modes result- 
ing in a non-diagonal error matrix. The final diagonal error 
is smaller than its "full-sky" value but is the off-diagonal 
elements of Cov(^,/) what determine Coy{0,0'). Yet, we 
circumvent the problem of computing non-diagonal compo- 
nents of Cov(^, £') by assuming the scaling of the covariance 
in Configuration space as 1/ fsky 
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Figure Cl. Correlations of Cl spectra induced by partial sky 
coverage. The total error budget in a partial sky survey increases 
(roughly by a factor 1/ fsky) compared to the full sky case. The 
covariance matrix is no longer diagonal (bottom panel) with the 
variance error smaller than its full sky value, by 30% in this par- 
ticular case, as shown in the top panel (see text for details). 



A standard way to overcome the complex covariance in 
Harmonic Space is to bin the measured Ci spectra in such 
a way to make the covariance matrix block-diagonal (e.g. 
Cabre et al. 2007). A simple rule of thumb discussed in 
Cabre et al. 2007 is to choose A£ fsky ~ 2, which is in very 
nice agreement with the width of the Cov(^,^') distribu- 
tion in Fig. ICll It remains to be studied whether this have 
an impact in methods like BAO where one is after short- 
wavelength features on top of the broad band Ci shape. 
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